The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Random forests are a class of ensemble methods for classification and regression with randomizing mechanism in bagging instances and selecting feature subspace. For high dimensional data, the performance of random forests degenerates because of the random sampling feature subspace for each node in the construction of decision trees. To address the issue, in this paper, we propose a new Principal Component...
Feature extraction is an essential preprocessing step in machine learning and data mining. Generally, supervised feature extraction algorithms with prior knowledge outperform unsupervised ones without prior knowledge. In particular, nearly all existing supervised feature extraction algorithms employ class labels or pairwise constraints as supervised information. In this paper, we propose to employ...
As the emergement of high-throughput measurement technologies, we are entering the big data era. Modern data are often generated from heterogeneous multiple sources, thus can be called multi-view data. The challenge of effectively integrating such data for decision making and novel knowledge discovery is raised. Matrix factorization methods have historically played important roles in various analyses...
Traditional nonnegative matrix factorization (NMF) is an unsupervised method for linear feature extraction. Recently, NMF with block strategy is shown to be able to extract more sparse and discriminative information of the images. To enhance the discriminative power of NMF, this paper proposes a block kernel nonnegative matrix factorization (BKNMF) based on the kernel theory and block technique. Kernel...
For big, high-dimensional dense features, it is important to learn compact binary codes or compress them for greater memory efficiency. This paper proposes a Binarized Multilinear PCA (BMP) method for this problem with Free-Form Reshaping (FFR) of such features to higher-order tensors, lifting the structure-modelling restriction in traditional tensor models. The reshaped tensors are transformed to...
This paper presents an improvement of the ELMVIS+ method that is proposed for fast nonlinear dimensionality reduction. The ELMVIS++C has an additional supervised learning component compared to ELMVIS+, which is originally an unsupervised method as like the majority of the other dimensionality reduction method. This component prevents samples under the same class being separated apart from each other...
The Liquid State Machine (LSM) exploits the computation capability of recurrent spiking neural networks by incorporating a randomly generated reservoir, which is often fixed. This standard choice relaxes the challenging need for training the complex recurrent reservoir. The fixed reservoir is used as a generic kernel to map the temporal input signals to the internal network dynamics, and a readout...
This paper discusses feature extraction methods. The feature extraction methods such principal component analysis and multiple discriminant analysis are very important techniques in machine learning research areas. The characteristic of feature extraction is to transform the data from a difficultly classified space to a easily classified space. There are many conventional machine learning methods...
Kernel Spectral Clustering (KSC) solves a weighted kernel principal component analysis problem in a primal-dual optimization framework. It builds an unsupervised model on a small subset of data using the dual solution of the optimization problem. This allows KSC to have a powerful out-of-sample extension property leading to good cluster generalization w.r.t. unseen data points. However, in the presence...
In this paper, we present a new and effective dimensionality reduction method called locality sparsity preserving projections (LSPP). Locality preserving projections (LPP) and sparsity preserving projections (SPP) only focus on an aspect of local structure and sparse reconstructive information of the dataset, respectively. The proposed method integrates the sparse reconstructive information and local...
This paper proposes a novel approach that combines specialized pairwise classifiers trained with different feature subsets for facial expression classification. The proposed approach first detects and extracts automatically faces from images. Next, the face is split into several regular zones and textural features are extracted from each zone to capture local information. The features extracted from...
The Johnson-Lindenstrauss (JL) lemma, with known probability, sets a lower bound q0 on the dimension for which a random projection of p-dimensional vector data is guaranteed to be within (1±ε) of being an isometry in a randomly projected downspace. We study several ways to identify a “good” rogue random projection when the target downspace has dimensions below the JL limit. The tools used towards...
This paper develops a framework for determining the Remaining Useful Life (RUL) of aero-engines. The framework includes the following modular components: creating a moving time window, a suitable feature extraction method and a multi-layer neural network as the main machine learning algorithm. The proposed framework is evaluated on the publicly available C-MAPSS dataset. The prognostic accuracy of...
Gait recognition is a rising biometric technology which aims to distinguish people purely through the analysis of the way they walk, while the problem is that the dimensionality of the gait data is too high, so it is necessary to carry on dimensionality reduction task. Up to date, in the area of computer vision and pattern recognition, various dimensionality reduction algorithms have been employed...
A new non-parametric method for reducing the number of dimensions in binary and continuous data, and for measuring the complexity of binary and continuous datasets, is introduced. The method, named Structural Manifold Analysis (SMA), is based on “Generalized Invariance Structure Theory” [1–6], a theory that has been successful in characterizing and accurately predicting human concept learning and...
Tactile sensing has recently attracted significant research interest in robotics. Despite the fact that tactile sensors provide temporal sequences of readings, state-of-the-art material recognition approaches are episodic, i.e. a whole sequence of readings is processed to identify the material. Based on vibration frequency response, this work presents an online identification technique using recursive...
Surveys are used by hospitals to evaluate patient satisfaction and to improve operation. Collected satisfaction data is usually represented to the hospital administration using statistical charts and graphs. Although this statistical data and visualization is helpful, but because of the size and dimension of the dataset, it is very difficult if not impossible, to identify important factors that could...
Intelligent Transportation Systems (ITS) plays a significant role in the traffic management, i.e. traffic jam prediction, route guidance. Due to the hardware failure or data transformation failure, some traffic observation data may be occasionally missed, which seriously affect intelligent transportation information service. So, the completion of traffic observation data has now become an issue that...
We propose an effective subspace selection scheme as a post-processing step to improve results obtained by sparse subspace clustering (SSC). Our method starts by the computation of stable subspaces using a novel random sampling scheme. Thus constructed preliminary subspaces are used to identify the initially incorrectly clustered data points and then to reassign them to more suitable clusters based...
Loop closure detection benefits simultaneous localization and mapping (SLAM) in building a consistent map of the environment by reducing the accumulate error. Handcrafted features have been successfully used in traditional approaches, whereas in this paper, we show that unsupervised features extracted by deep learning models, can improves the accuracy of loop closure detection. In particular, we employ...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.